MPI-LIT: a literature-curated dataset of microbial binary protein--protein interactions
نویسندگان
چکیده
UNLABELLED Prokaryotic protein-protein interactions are underrepresented in currently available databases. Here, we describe a 'gold standard' dataset (MPI-LIT) focusing on microbial binary protein-protein interactions and associated experimental evidence that we have manually curated from 813 abstracts and full texts that were selected from an initial set of 36 852 abstracts. The MPI-LIT dataset comprises 1237 experimental descriptions that describe a non-redundant set of 746 interactions of which 659 (88%) are not reported in public databases. To estimate the curation quality, we compared our dataset with a union of microbial interaction data from IntAct, DIP, BIND and MINT. Among common abstracts, we achieve a sensitivity of up to 66% for interactions and 75% for experimental methods. Compared with these other datasets, MPI-LIT has the lowest fraction of interaction experiments per abstract (0.9) and the highest coverage of strains (92) and scientific articles (813). We compared methods that evaluate functional interactions among proteins (such as genomic context or co-expression) which are implemented in the STRING database. Most of these methods discriminate well between functionally relevant protein interactions (MPI-LIT) and high-throughput data. AVAILABILITY http://www.jcvi.org/mpidb/interaction.php?dbsource=MPI-LIT. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Comparison of Essential and Non Essential Amino Acids in the Microbial Protein of Pleurotus Florida from the Lignocellulosic Wastes
Introduction: Cereal straws contain Cellulose, Hemicelluloses and Lignin and are most available renewable biopolymers. White rot fungi is used to convert these wastes into microbial protein. Pleurotus Florida are having the most delignification ability amongst other micro-organisms. We determined the amounts of protein, essential and non essential amino acids of the produced microbial protein f...
متن کاملRefining Literature Curated Protein Interactions Using Expert Opinions
The availability of high-quality physical interaction datasets is a prerequisite for system-level analysis of interactomes and supervised models to predict protein-protein interactions (PPIs). One source is literature-curated PPI databases in which pairwise associations of proteins published in the scientific literature are deposited. However, PPIs may not be clearly labelled as physical intera...
متن کاملMPIDB: the microbial protein interaction database
SUMMARY The microbial protein interaction database (MPIDB) aims to collect and provide all known physical microbial interactions. Currently, 22,530 experimentally determined interactions among proteins of 191 bacterial species/strains can be browsed and downloaded. These microbial interactions have been manually curated from the literature or imported from other databases (IntAct, DIP, BIND, MI...
متن کاملComprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae
BACKGROUND The study of complex biological networks and prediction of gene function has been enabled by high-throughput (HTP) methods for detection of genetic and protein interactions. Sparse coverage in HTP datasets may, however, distort network properties and confound predictions. Although a vast number of well substantiated interactions are recorded in the scientific literature, these data h...
متن کاملProtein-protein Interaction Networks of E. coli and S. cerevisiae are similar
Only recently novel high-throughput binary interaction data in E. coli became available that allowed us to compare experimentally obtained protein-protein interaction networks of prokaryotes and eukaryotes (i.e. E. coli and S. cerevisiae). Utilizing binary-Y2H, co-complex and binary literature curated interaction sets in both organisms we found that characteristics of interaction sets that were...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 22 شماره
صفحات -
تاریخ انتشار 2008